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The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: DCOr- 

Application Serial Number: 0°\ / 0 1 5~ ^ 7 3 , 



Source: 
Date Processed by STIC: 



Ot/iq/^W/- T£CH CENTER 1600/25. 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 c-mail help: Datin21helD@uspto.eov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: Datin3helD@uspto.eov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990(old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 



http://www.uspto-gov/web/offices/pac/checker 
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RAW SEQUENCE LISTING DATE: 09/10/2001 

PATENT APPLICATION: US/09/015,078 TIME: 09:39:29 

Input Set : A:\235673-l.app 

Output Set: N:\CRF3\09102001\I015078.raw i r , 

uq'Sq to Oomefv 

SEQUENCE LISTING CiQnk?t*fi Dl^^ ill^.? 

4 (1) GENERAL INFORMATION: ^WfGCX-O UhKelB N@8d@d 

6 (i) APPLICANT: Suerbaum, Sebastian 

7 Labigne, Agnes 

9 (ii) TITLE OF INVENTION: Cloning and Characterization of the flbA 

^ , , . Gene of H - Pylori, Production of Aflagellate Strains 

12 (ill) NUMBER OF SEQUENCES: 13 

14 (iv) CORRESPONDENCE ADDRESS: 

15 ( A ) ADDRESSEE: Finnegan, Henderson, Farabow, Garrett & 

16 Dunner i 

17 < B > STREET : 1300 I Street, N.W. r J. Se^v^Mue / b 

18 (C) CITY: Washington 9rt*f* * - ^e^M^ If* 

19 (D) STATE: D.C. C ™ (W ft- H< (UtU ^ 

20 (E) COUNTRY: USA v, A o I^^Q 

21 (F) ZIP : 20005-3315 , Cfr- 0* ol l( Hi*- 

2 3 (v) COMPUTER READABLE FORM: 1 | /] , 1 

24 < A > MEDIUM TYPE : Floppy disk ~WIua> tMJU W ^*>****> 

25 (B) COMPUTER: IBM PC compatible X 1 

26 (C) OPERATING SYSTEM: PC-DOS/MS -DOS ¥ld » l/J<W ^ 

27 < D > SOFTWARE: Patentln Release #1.0, Version #1.30 ,r> * * r\ t d\\Of 
29 (vi) CURRENT APPLICATION DATA: f (0 2 PU^C ^ \ ) f 

c "~ > 30 (A) APPLICATION NUMBER: US/09/015,078 

c "> 31 (B) FILING DATE: 23-Aug-2001 

32 (C) CLASSIFICATION: 

34 (viii) ATTORNEY/AGENT INFORMATION: 

3 5 (A) NAME: Meyers, Kenneth J. 

36 (B) REGISTRATION NUMBER: 25,146 

37 (C) REFERENCE/DOCKET NUMBER: 02 356.0073-01000 

39 (ix) TELECOMMUNICATION INFORMATION: 

40 (A) TELEPHONE: (202) 408-4000 

41 (B) TELEFAX: (202) 408-4400 
44 (2) INFORMATION FOR SEQ ID NO: 1: 

4 6 (i) SEQUENCE CHARACTERISTICS: 

47 (A) LENGTH: 19 base pairs 

4 8 (B) TYPE: nucleic acid 

49 (C) STRANDEDNESS: single 

50 ( D ) TOPOLOGY: linear 

52 (ii) MOLECULE TYPE: DNA (genomic) 

57 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

59 ATGCCNGGNA AAGCARATG 19 
61 (2) INFORMATION FOR SEQ ID NO: 2: 

6 3 (i) SEQUENCE CHARACTERISTICS: 

64 (A) LENGTH: 18 base pairs 

65 (B) TYPE: nucleic acid 

6 6 (C) STRANDEDNESS: single 

6 7 (D) TOPOLOGY: linear 

69 (ii) MOLECULE TYPE: DNA (genomic) 
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RAW SEQUENCE LISTING DATE: 09/10/2001 

PATENT APPLICATION : US/09/015,078 TIME: 09:39:29 

Input Set : A:\235673-l.app 
Output Set: N:\CRF3\09102001\I015078.raw 

74 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

76 RAAYTTCATN GCNCCRTC 

78 (2) INFORMATION FOR SEQ ID NO : 3: 

80 (i) SEQUENCE CHARACTERISTICS: 

81 (A) LENGTH: 135 base pairs 

82 (B) TYPE: nucleic acid 

83 (C) STRANDEDNESS : single 

84 (D) TOPOLOGY: linear 

86 (ii) MOLECULE TYPE: DNA (genomic) 

91 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

93 ATGCCAGGAA AGCAAATGGC GATTGATGCG GATTTAAATT CAGGGCTTAT TGATGATAAG 6 0 

9 5 GAAGCTAAAA AACGGCGCGC CGCTCTAAGC CAAGAAGCGG ATTTTTATGG TGCGATGGAT 12 0 

97 GGCGCGTCTA AATTT 135 
99 (2) INFORMATION FOR SEQ ID NO : 4: 

101 (i) SEQUENCE CHARACTERISTICS: 

102 (A) LENGTH: 28 base pairs 

103 (B) TYPE: nucleic acid 

104 (C) STRANDEDNESS: single 

105 (D) TOPOLOGY: linear 

107 (ii) MOLECULE TYPE: DNA (genomic) 

112 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 

114 CGGGATCCGT GGTTACTAAT GGTTCTAC 2 8 

116 (2) INFORMATION FOR SEQ ID NO : 5: 

118 (i) SEQUENCE CHARACTERISTICS: 

119 (A) LENGTH: 28 base pairs 

120 (B) TYPE: nucleic acid 

121 (C) STRANDEDNESS: single 

122 (D) TOPOLOGY: linear 

124 (ii) MOLECULE TYPE: DNA (genomic) 

129 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

131 CGGGATCCTC ATGGCCTCTT C AG AG AC C 2 8 

133 (2) INFORMATION FOR SEQ ID NO : 6: 

13 5 (i) SEQUENCE CHARACTERISTICS: 

136 (A) LENGTH: 2501 base pairs 

137 (B) TYPE: nucleic acid 

138 (C) STRANDEDNESS: single 

139 (D) TOPOLOGY: linear 

141 (ii) MOLECULE TYPE: DNA (genomic) 

146 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 

14 8 AGCTTTTTTG TGCCATACTT TTAAACTTTA TATTATAATA AGAGACAAAC ACACCTACCA 6 0 
150 AAATTAAGGC ATTGATTTTA GATTATGGCA AACGAACGCT CCAAATTAGC TTTTAAAAAG 120 
152 ACTTTCCCTG TCTTTAAACG CTTCTTGCAA TCCAAAGACT TAGCCCTTGT GGTCTTTGTG 180 
154 ATAGCGATTT TAGCGATCAT TATCGTGCCG TTACCGCCTT TTGTGTTGGA TTTTTTACTC 240 
156 ACGATTTCTA TCGCGCTATC GGTGTTGATT ATTTTAATCG GGCTTTATAT TGACAAACCG 300 
158 ACTGATTTTA GCGCTTTCCC CACTTTATTA CTCATTGTAA CCTTATACCG CTTGGCTTTA 360 
160 AATGTCGCCA CCACTAGAAT GATTTTAACC CAAGGCTATA AAGGGCCTAG CGCGGTGAGC 4 20 
162 ATTATTATCA CGGCGTTTGG GGAATTTAGC GTGAGCGGGA ATTATGTGAT TGGGGCTATT 4 80 
164 ATCTTTAGTA TTTTAGTGCT GGTGAATTTA TTAGTGGTTA CTAATGGTTC TACTAGGGTT 54 0 
166 ACTGAAGTTA GGGCGCGATT TGCCCTAGAC GCTATGCCAG GAAAGCAAAT GGCGATTGAT 6 00 



file://C:\Crf3\Outhold\VsrI015078.htm 



9/10/01 



Page 3 of 7 



RAW SEQUENCE LISTING DATE : 09/10/2001 

PATENT APPLICATION: US/09/015,078 TIME: 09:39:29 

Input Set : A:\235673-l.app 

Output Set: N:\CRF3\09102001\I015078.raw 



l AAAAACGG CG 


CGCCGCTCTA 


660 


1 CTAAATTTGT 


CAAAGGCGAT 


720 


GGGGTTTTTT 


AGTGGGCGTG 


780 


TCACTATCTT 


AACCATTGGC 


840 


CACGGACCGG 


TATTGTCGCC 


900 


AGCTCATCAC 


ACAGCTCACC 


960 


GCTTTTGCAC 


CATTCCTGGA 


1020 


TATTCATCGC 


ATGGCTGATT 


1080 


ATTATTTGAG 


TCAAAAATTC 


1140 


AACCCCACGC 


CCCCACCACA 


1200 


AAGCCATTGA 


TGAAGTGTTA 


1260 


ACAGCTTAGC 


GGACATGAAA 


1320 


AAAAGATAGC 


GAGCGATTAT 


1380 


AACTCCCCCC 


AACGCATTAT 


1440 


TGATGCCGGA 


TAAGTTTTTA 


1500 


TTCCTACTAA 


AGAGCCGGCT 


1560 


AAGAAGCCAT 


CATTCAAGGC 


1620 


CCAGCGAATT 


AG TG AAAAAA 


1680 


TTTTAGAGCG 


CTTGGCCAAA 


1740 


CCGGTGCGAT 


CCGATCAGTC 


1800 


TGCTCACTAT 


TTTAGAAACG 


1860 


TCTTAACCGA 


ACAAGTGAGG 


1920 


AAGACGGGCG 


TTTGAAATTT 


1980 


AATTGCGAGA 


AAATGGCACT 


2040 


TCATTGAAGC 


GGTCTCTGAA 


2100 


TTTTGATCGT 


AGAGCCTAAT 


2160 


TTGATGTAAT 


CGTGCTAAGC 


2220 


GCACGATCCA 


TATTAACTTT 


2280 


TTTATCACCT 


TTCACACATT 


2340 


TTTTTAAAAA 


TATCCAATGC 


2400 


ATGAGATTTT 


AAACGCGATC 


2460 


A 




2501 



*. -l. -i.x_-v_.j_ ini _i_ _i_ 

2 32 (2) INFORMATION FOR SEQ ID NO: 7: 

234 (i) SEQUENCE CHARACTERISTICS: 

235 (A) LENGTH: 732 amino acids 

236 (B) TYPE: amino acid 

237 (C) STRANDEDNESS : single 

238 (D) TOPOLOGY: linear 
240 (ii) MOLECULE TYPE: peptide 

245 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 

247 Met Ala Asn Glu Arg Ser Lys Leu Ala Phe Lys Lys Thr Phe Pro Val 

248 15 10 15 

250 Phe Lys Arg Phe Leu Gin Ser Lys Asp Leu Ala Leu Val Val Phe Val 

2 51 20 
253 



2 5 30 



He Ala He Leu Ala He He He Val Pro Leu Pro Pro Phe Val Leu 



254 35 
256 



40 45 



2 57 50 
259 



Asp Phe Leu Leu Thr He Ser He Ala Leu Ser Val Leu He He Leu 



55 60 



He Gly Leu Tyr He Asp Lys Pro Thr Asp Phe Ser Ala Phe Pro Thr 
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260 65 70 75 80 

2 62 Leu Leu Leu He Val Thr Leu Tyr Arg Leu Ala Leu Asn Val Ala Thr 

265 Thr Arg Met lie Leu Thr Gin Gly Tyr Lys Gly Pro Ser Ala 111 Ser 

256 100 105 HO 

268 lie He He Thr Ala Phe Gly Glu Phe Ser Val Ser Gly Asn Tyr Val 

269 115 120 125 

271 He Gly Ala He He Phe Ser He Leu Val Leu Val Asn Leu Leu Val 

272 130 135 140 

274 yal Thr Asn Gly Ser Thr Arg Val Thr Glu Val Arg Ala Arg Phe Ala 

275 145 150 1 55 16Q 
277 Leu Asp Ala Met Pro Gly Lys Gin Met Ala He Asp Ala Asp Leu Asn 

165 170 175 

280 ser Gly Leu He Asp Asp Lys Glu Ala Lys Lys Arg Arg Ala Ala Leu 

281 180 185 190 

283 ser Gin Glu Ala Asp Phe Tyr Gly Ala Met Asp Gly Ala Ser Lys Phe 

284 I 95 200 205 

286 val Lys Gly Asp Ala He Ala Ser He He lie Thr Leu He Asn He 

287 210 215 220 

289 He Gly Gly Phe Leu Val Gly Val Phe Gin Arg Asp Met Ser Leu Ser 

290 225 230 235 240 

292 Phe Ser Ala Ser Thr Phe Thr He Leu Thr He Gly Ala Gly Leu Val 

293 245 250 255 

295 Gly Gin He Pro Ala Leu He He Ala Thr Arg Thr Gly He Val Ala 

296 260 265 270 

298 Thr Arg Thr Thr Gin Asn Glu Glu Glu Asp Phe Ala Ser Lys Leu He 

299 275 280 285 

301 Thr Gin Leu Thr Asn Lys Ser Lys Thr Leu Val He Val Gly Ala He 

302 290 295 300 

304 Tyr Cys Phe Cys Thr He Pro Gly Leu Pro Thr Phe Ser Leu Ala Phe 

305 305 310 315 320 

307 val Gly Ala Leu Phe Leu Phe He Ala Trp Leu He Ser Arg Glu Gly 

308 325 330 335 

310 Lys Asp Gly Leu Leu Thr Lys Leu Glu Asn Tyr Leu Ser Gin Lys Phe 

311 340 345 350 

313 Gly Leu Asp Leu Ser Glu Lys Pro His Ser Ser Lys He Lys Pro His 

314 355 360 365 

316 Ala Pro Thr Thr Arg Ala Lys Thr Gin Glu Glu He Lys Arg Glu Glu 

317 370 375 380 

319 Glu Gin Ala He Asp Glu Val Leu Lys He Glu Phe Leu Glu Leu Ala 

320 385 390 395 400 

322 Leu Gly Thr Gin Leu Tyr Ser Leu Ala Asp Met Lys Gin Gly Gly Asp 

323 405 410 415 

325 Leu Leu Glu Arg He Arg Gly He Arg Lys Lys He Ala Ser Asp Tyr 

326 420 425 430 

328 Gly Phe Leu Met Pro Gin He Arg He Arg Asp Asn Leu Gin Leu Pro 

329 435 440 445 

331 Pro Thr His Tyr Glu He Lys Leu Lys Gly He Val He Gly Glu Gly 

332 450 455 460 
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334 Met Val Met Pro Asp Lys Phe Leu Ala Met Asn Thr Gly Phe Val Asn 

335 465 470 475 480 
Lys Glu He Glu Gly He Pro Thr Lys Glu Pro Ala Phe Gly Met Asp 

485 490 495 

Ala Leu Trp lie Glu Thr Lys Asn Lys Glu Glu Ala He He Gin Gly 
341 500 505 510 

343 Tyr Thr He He Asp Pro Ser Thr Val He Ala Thr His Thr Ser Glu 

J44 515 520 525 

346 Leu Val Lys Lys Tyr Ala Glu Asp Phe He Thr Lys Asp Glu Val Lys 

347 530 535 540 



337 
338 
340 



349 ser Leu Leu Glu Arg Leu Ala Lys Asp Tyr Pro Thr He Val Glu Glu 

350 545 550 555 560 
Ser Lys Lys He Pro Thr Gly Ala He Arg Ser Val Leu Gin Ala Leu 

565 570 575 

Leu His Glu Lys He Pro He Lys Asp Met Leu Thr He Leu Glu Thr 
356 580 585 590 

He Thr Asp He Ala Pro Leu Val Gin Asn Asp Val Asn He Leu Thr 
359 595 goo 605 

361 Glu Gin Val Arg Ala Arg Leu Ser Arg Val He Thr Asn Ala Phe Lys 

362 610 615 620 
Ser Glu Asp Gly Arg Leu Lys Phe Leu Thr Phe Ser Thr Asp Ser Glu 

365 625 630 635 640 

367 Gin Phe Leu Leu Asn Lys Leu Arg Glu Asn Gly Thr Ser Lys Ser Leu 

368 645 650 655 
Leu Leu Asn Val Gly Glu Leu Gin Lys Leu He Glu Ala Val Ser Glu 

371 660 665 670 

373 Glu Ala Met Lys Val Leu Gin Lys Gly He Ala Pro Val He Leu He 

374 675 680 685 
Val Glu Pro Asn Leu Ar ? Lys Ala Leu Ser Asn Gin Met Glu Gin Ala 

377 690 695 700 

Arg He Asp Val He Val Leu Ser His Ala Glu Leu Asp Pro Asn Ser 
380 705 7 10 715 ' 720 

Asn Phe Glu Ala Leu Gly Thr He His He Asn Phe 

725 730 
385 (2) INFORMATION FOR SEQ ID NO : 8: 

387 (i) SEQUENCE CHARACTERISTICS: 

388 (A) LENGTH: 732 amino acids 

389 (B) TYPE: amino acid 

390 (C) STRANDEDNESS: single 

391 (D) TOPOLOGY: linear 
393 (ii) MOLECULE TYPE: peptide 
398 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 

Met Ala Asn Glu Arg Ser Lys Leu Ala Phe Lys Lys Thr Phe Pro Val 
1 5 10 15 

Phe Lys Arg Phe Leu Gin Ser Lys Asp Leu Ala Leu Val Val Phe Val 

404 20 25 30 

He Ala He Leu Ala He He He Val Pro Leu Pro Pro Phe Val Leu 

35 40 45 

Asp Phe Leu Leu Thr He Ser He Ala Leu Ser Val Leu He He Leu 



352 

353 

355 

356 

358 

359 

361 

362 

364 

365 

367 

368 

370 

371 

373 

374 

376 

377 

379 

380 

382 

383 



400 
401 
403 
404 
406 
407 
409 
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VERIFICATION SUMMARY DATE : 09/10/2001 

PATENT APPLICATION: US/09/015,0 78 TIME: 09:39:30 

Input Set : A:\235673-l.app 

Output .Set: N:\CRF3\09102001\I015078.raw 

:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER : ] 
:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 
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